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IMPROVED APPARATUS AND METHOD FOR 
MULTI-THREADED SIGNAL PROCESSING 

Field of Invention 

Invention relates to electronic data and signal processing, particularly to high- 
5 perfonnance multi-threaded infoimation processing techniques. 

Background of Invention 
Traditional methods for achieving high-performance in coiiq>utational systems 
for digital infoimation processing have centered around the design of architectures 
tiua deliver greater levels of parallelism. This is typically achieved via fhe design of 
10 processors and instruction-set architectures that allow for the exploitation of hardware 
parallelism and software concuirmcy. 

High-performance is typically drfined as the ablKty lo execute a very large 
number of operations pa- second. This figure of mart is strongly dqyoident on the 
type of operations, which ^ically dq)ends on the type of application targeted. 
15 Traditional design of higbfetfonnance information processing systems 

usually idies on principles of computer architecture to define several key attributes of 
the processing system: 

• Imtruction-set architecture refers to the actual programmer-visible sets of 
instructi(Mis» and serves as the boundary between hardware and software. 

20 • Orsmtizutfzm refers to high-l^^^ 

system^ bus structure, and internal CPU design. 

• Hardware refers to spedfic detailed logic design, chcuit implementation, and 
packaging. 
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In order to achieve higb-peifonnance, which is an attribute typically required 
in special-purpose processors (i.e., built for special applications), three approaches are 
taken: 

(1 ) Instraction-levd parallelism: this approach, which exploits 
parallelism in hardware, provides for parallel threads of processing via the 
use of a voy loi^ or vectorized instruction word« whose fields can be 
decomposed into concurrent processing threads. The mechanism to e}q>loit 
this parallelism may be realized via a scheduler, v4iich schedules 
opmtions onto one of several datapath processing units. This sdieme has 
many drawbacks, including the difficulty of building the scheduler and 
identifying oiough parallelism to adueve desired thiouglq)ut 

(2) Siq>ascalar techniques: this ^roach exploits fine-grain hi^y- 
pipelined, single-threaded processor aichitectures to adiieve higli 
performance. This scfaone may achieve v^ high perfomiance» but only 
for a small class of operations. For operations not well-matched to a 
particular datapath architecture, performance of superscalar design is 
reduced significantly. Thus, tibiesi^erscalaraiyproach is unsuitable for 
wide-ranging £q>plications with high signal^rocessing content 

(3) Monoiy hierarchy techniques: to hide latmcy of memory accesses 
to slower memories, memory hierarchy tedmiques have hem used 
extensively, especially in microprocessor designs, to increase overall 
system performance by intelligently using fast memories, t.e., caches. 
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between the processor units and slow^ memory efifectively to hide latency 
of slower memory. 
Conventionally, multi-processor systems may employ multi-threaded 
processing to inq)rove coRq>ute perfoimance. Multi-threading gently is a known 
5 approach for enhancing compute resource utility> and thus, overall processing 
performance. However, ordinary multi-threaded processing solutions are 
implemented using comptec distributed or netwoited computer nodes, which are 
often not easily reconfigurable at lower logic or circuit level, nor contemplated for 
addressing advanced functional problem sets, such as multi-mode tetecommunications 
10 aIg(Mrithmsornetworidngi»iQtocols. Accordingly, ihoe is a need for iniproved multi- 
thread processing soIutioiL 



SummaTV of invention 
Invention resides in design and implementation methodology, processor 

15 architecture, and system fa: processing multi-threaded digital information (signal or 
data representation) to improve functional perfoimance. Preferably, general system 
design or functioiial definition, algorithm, electnmic signal, or data file is provided 
initially to include odc or more nmlti-direaded rqnesenlation. Such initial prototype 
design or fimctian may ttten be profiled or otherwise characterized for parallel or 

20 eSfectivdy amilar processing, in particalar, in order fonctionally to use or otherwise 
be impkmoited in one or more omespcmding fixed, paiameterizable, programmable, 
or configurable logic unhs or other equivalent fimctional signal-processing kernel or 
element, using tenqxnal and/(»: non^empcwBl functional considerations. 
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Preferably, relatively complex system functionality, such as for application to 
digital oommunications and/or networictng and/or media processing system design, is 
analyzed according to pre-specified system design rules, mathematical operations, 
sequoices of operations, or parameters, and then symbolically or schematically 

5 represented to identic one or more algorithms, specific sequences of opmuions, 
patterns of memory accesses, or segments (Le., single or multi-**threads**), which may 
each be profiled, structured, or otherwise characterized for optimized operation or 
i^^>Ianentation using one or more particular fixed, parameterizable, piogvammable, 
or configurable lo^c unit or kernel elements. Such element is built by providing a 

10 datapath, whose structure and configurabili^ is determined via profilii^ a 

sequencei/fintte-state^machine, whose structure and configurability is determined via 
jwn fiiifig j and local memory, whose structure is determined via profiling memory 
accesses and using locality to derive local memory properties. Optionally, one or 
nune kernel demmts are implemented oitirely in software or programmable logic, or 

15 combination thereof. Fuitfaa:, as described heiein^temi "profiling" refers generally to 
aiitomitfed and/m^ manual processing of ok or more ^stem or function modules to 
define one or more configurable stroctures associated with each module. 

Prief pesqiptiQq pf PyawiflSf 
20 FIG. 1 is a general methodology and tool arcbitecnire diagram for 

implementing in software and/or hardware a piefixred enibodiment of the present 
invention. 

FlGs. 2A-B are functional block diagrams for implementing one aspect of the 
present inventiixi. 

4 
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FIG, 3 is a representative functional diagram illustrating heterogoieous aspect 

of the present invention. 

FIG. 4 is a representative junctional diagram illustrating reconfigurable aspect 

of the preset invention. 
5 FIG. S is a representative functional diagram illustradng konel aspect of the 

present invention, 

FIG. 6 is a representative functional diagram illustrating interface aspect of the 
present invention. 

FIG. 7 is a system methodology flow chart showing fimctional operations for 
to implemoiting we or more aspects of the present invention. 

FIG. 8 is rqnesentative of software code stubs for implementing one or more 
aspects of the present inv^tion. 

FIG. 9A-B are rqxresentative fimctional diagrams of one or more appUcaticms 
of present inventian. 

15 

Detailed Description of Prefened Embodiment 
Present innovation enables autcnnated design and inq)l«nentation to process 
single or multi-tfareaded or equivalently partitioned processing of digital data. 5ignals» 
or ftmctiraalrqiresaitadon for improved processing pedb Initially, system 

20 design or functional definition, algorithm, electronic signal, or data file provides 
certain single or multi-direaded representation, ivhomqwn one or more sysxem design 
or fimction modules are profiled, structured, or o&erwise diaracterized for parallel or 
concurrent processii^. 
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For example, multi-threaded prototype may be used or otherwise be 
implemented m fixed, parameterizable^ programmable, or configurable logic unit or 
other signal-piDcessing kernel or element. Hence, complex system functionality, such 
as digital communication^ networking, or multi-media application, may be analyzed 
5 pa: system design rules, mathematical operations, sequraces of operations, or 

parameters, then symbolically Cft schematically represented to identify certain single or 
multi-thread algontbms^ specific sequences of operations, patterns of memory 
accesses, or segments, each tfaxead being profiled or characterized to optimize 
operation or implementation using fixed, parametorizable, programmable, or 

10 configurable logic unit or kernel element 

Optionally, dat^atfa structure is configured into single or multi-thread 
elment, as detennined by profiling, a sequence- and/or equivalent finite-state- 
madune, whose stnicture and ccmfiguiability is deterasined by profiling, and k)cal 
memory, whose stnicture is detennined by profiling memory accesses and locality to 

15 derive memory properties. 

As used hoem, profiling tenninology is understood to refer generally to any 
computer-autamated and/or manual processing, interpretation, or classification of one 
or more system or fimction modules to define or categorize one or more configurable 
structures as^)dated with eadi Euidule, e.g., by selectmg or assigning 0^ 

20 fimctionai elements or design objects, such as interconnection, signals, logic, circuits, 
etc. Prefoably, profiling is accomplished according to one or more imviously and/or 
dynamically defined criteria or fimeticmat rule set 
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Generally, in a computer-automated and/or manual development approach, a 
single or multi-tbreaded design is processed by providing initially a first-level 
functional definition representing a prototype systwn, such that an other^level 
functional definition symboHcaUy representing equivalent functionality maybe 
generated or effectively profiled tfaerefiom. In this hieraFchical design scheme, the 
genmted symbolic represenution may identify certain threads associated with the 
system design, preferably at one or mxc functional levels* 

Each thread may be profiled for processing by conresponding kernel 
element(s), and one or more common set of operations is identified for given threads, 
(e.g^ m a 1-to-l, multiple-to-l, or l-to*muItipie thread-to-kemel relationship). Each 
thread may filidier be ma|q)ed to identify the sequofice, or scheduling for 
each set of operators utilized to implem^t syston or functional modules, such as a 
sequence of arithmetic operations, control operations, and/or memmy access 
cpefations or related memory locations. 

Hmce, using lb& present ^stem development mdfaodology, a multi-threaded 
prf>f^igsmg iwrJiitiytinng mgy Hiihstantiftlly include a set of kmiel eloaents, such tiiat 
one kernel element processes certain function represented by corresponding thread, 
and anofli^r keniel element in the same prototype design processes oth^ fimcti(m 
tqiresentedbyotiiercorre^xmdingflicead. b tins partitioned or distributed 
processing qqiroach, each thread may be profiled separately or ineraxducaliy for 
an»qxriate multi-level gt fimctional group processiqg. For exanq>le, a first-level or 
group konel element and a second-level or group kernel elemoit, respectivdy are 
associated witii a corresponding first thread and second thread in a given fimction or 
^stem design. 
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In aiepresentative system design for wireless code division multiple access 
(CDMA) communicatioiis application, it is contemplated ftat various kemels maybe 
provided to serve different functional groups, such as: ftont-end processing (e.g.. data 
5 switch selector, sample interpolation, etc.); chip-rate procesang (e.g., sample epoch 
selection, tnatched filter, gaieric de^rcader, generic dechannelizer, code genoration 
unit, mtegirate and dump, genmc searcher control, etc.); symbol sequence prooessmg 
(e.g., transport fonnal decoder, dynamic spreading fector cojzqputer, fast Hadamard 
transfoim, etc.); channel element processing (e.g., alignment/deskewing, combiner, 
10 soft decision con^ter, interpatti interference equalizer, receive antoma diversity 
combiner, etc.); interleaving (eg., deinterleavcr ccmtroUer); and channel coding (e.g-, 
tnibo decoder^ convohidcmal decoder, etc.). 

Generally, present approash enables one or more functional or system designs 
to be implemented efficiently, preferably via cunent muM-fhreading scheme, in a 
15 single {TOcessor ardntecture by re-param^oizing, rquogramming, or reconfiguring 
kemd elements (ie., as detennined by profiling technique as described further 
therdn,) fiom which corresponding direads are assembled, and/or by changing 
sequence of opoB&ms 0.e., as detennined by mappiog and/or «±eduling) with which 
threads are inqikmoited. Ftefisned onbodinmit implements functional or system 
20 design in one or more het^geneous and reconfigorable logic or kernel elements 
acconling to so-called '1>1U/' process, as desmbed further h^^ 

FIG, 1 is a geaieral architecture or systmn block diagram showing top-level 
overview of pnssent design methodology, functional modules, and software and/or 
haidware tool architecture, preferably implemented in one or more electronic design 

8 
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automation platfonns, including one or more stand-alone or networked computers, 
piocessors, engmeering workstations, or other compute facility having appropriate 
operating system, user interface* storage management, communications interfaces, and 
other computer-aided design and engineering tools. Preferably, it is contemplated that 

5 present design mediodology serves to provide a tool architecture and processor 
implementation and architecture, or data file rq)resentative thereof, for cabling 
system architecture, such as network implementation. 

As shown, initially one or more fimctional definiticHi files 1 0, such as design 
netlist, or high-level description language (such as C or HDL) defining one or more 

10 functional modules or algorithms 12 is provided manually or computed automatically. 
In accordance widi one aspect of fmsent implementation, functionally^selective 
]Kofiling and mapping sdieme 14 is processed or q>plied to primitives 16 and 
functional definitions 10 to generate or provide, particularly on a multi-threaded basis, 
one or more control and comnnmicadon signals 26 and kernels 1 8. Further, profiling 

15 and mapping 14 provides scheduling data for schedule operation tables 20. Control 
and c(»nmumcation signals are processed according to one or more predefined 
selected functional rule set or signaling flags, e.g., communication semi^hores 24, 
Various konels 18 are processed and interconnected for inqilementation 22, for 
example, tnieconfigarable fonn as described herein for multi-threaded signal 

20 processing. 

FIGs. 2A-B functional block diagrams show represoitative s^ of temels 18, 
28 and thdr physical implementation, inchiding schedule and allocate fimction 30. 
Pidersibly, one or more kernel 1 8 is associated with or corresponds to profiled and 
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mapped thread, and is implemented reconfigurably using sequencer 32, datapath 34, 
and memory 36. 

Hence, according to present system and circuit design methodology and/or 
computing apparatus, general functional definition is implementable using single or 
multi-threaded rqwesentation thereof, which may be profiled cflfectively for paraUel 
processing using one or more corresponding kernel logic elements (e.g., according to 
l-to-multi» 1-to-l, ouilti-to-1 ormulti-to-nuilti kernel to thread relationship.) For 
example, communicadoD, networking, or media processing fimctionaltty or algorithm 
is functionally analyzed and symbolically represented to identify one or more thread 
segmoits, which are each profiled or otherwise characterized for optimized operation 
or implementation using one or more particularly designated fixed, param^crizable, 
programmable, or leconfigurable logic kemeL 

FIG. 3 functional diagram shows r^resentative heterogeneous, reconfigurable, 
multi<*proces5ing arrangement, for example, whereupon kernel 8 may implraient 
""smair granularity threaded function, and kemel 6 tmy implement ''large*' granularity 
tfireaded function, hi diisrecoiifigurabte arrangement, various levels of fimctional 
granularity, which is preferably an attribute of design function and cone^ndixig 
kernel, may be inq)lemented or dynamically reconfigured according to de^gn 
lequiremeat or profile ms^ing preference. 

For further illustration, FIG. 4 functional diagram shows one or more 
representative or available omfigurable logic or fimctions v/bich may be employed 
according to present approach for implementing single or multi-threads into 
designated kernels, such as reconfigurable logic or piogrammable function units 
(PFU) 40 having programmable logjc elements and switch matrix (e.g., for mco^g 

10 
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biMevel operationsX reconfigurable datapaths 42 having multiplex^, registerSp 
adders, bufifers, etc. and configurable signal flow through these elements (e.g., for 
dedicated dattipgtfb filters)^ reconfigarable arithmetic 44 having address generators^ 
memoTfy memoiy address control, etc. (e.g., for arithmetic convolution kernels), and 
reconfigurable control 46 having data memory* datapath, program memory, 
instruction decoder and controller, etc. (e.g., for real-time operatixiig system process 
management). 

Moreover, as fiuther iUustradon of sanQ}le kernel implementation, FIG. 5 
functional diagram shows preferred functional elements [or implementing kmel IS, 
including data sequencer 32, data memoiy 36, and paiameterizable configurable 
aritfmetic logic unit (ALU) 34* 

PIG. 6 is a represmtative fimctional diagram ilhistrating optional interface 
b^en dynamically reconfigurable logic (DRL) process 64 and associated 
configuration database for processing functions extonally to main processor hardware 
model SO. Preferably, DKL process is heterog^eous and reconfigurable, and 
implemoted using cmient innovation. As shown, hardware interfaces 54 couples 
processor dement 52 associated with library 62 and specified functional modules 60, 
including proces s or software model 57 having C-program model 56 and input/ou^ut 
device drivers 58 to extenial DRL process 64. 

Intius optional embodimmt, one or moi^ smg^e or muld-threaded digital 
infmnation (e.g., signal or data lepresentatiraX such as general system design or 
fimctional definition, algorithm, electronic signal or data file is provided initially to 
include one or more multi-threaded representation, and such initial prototype design 
or fimcdcHX is profiled or otherwise characterized for paralld or effectively similar 

It 
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piocessing, in particular, in order functionally to use or otherwise be implemented in 
one or more corresponding fixed, parametoizable, programmable, or configurable 
logic unit or other equivalent fimctional signal-processing kernel or elment in 
processor model SO, 37 for fiinctional cooperation or emulated real-time signal 
interaction with external DRL process 64, 

FIG. 7 flow chart shows anottier aspect of present operational steps. Initially, 
user-generated or computer-generated fimctions are defined 70 for prototype or other 
system deagn- Then, one <»:moie mathematical analysis or design performance 
optimization scheme may be f^plied 72 to initial design definition. Next, one or more 
constituent algorithms for design definition is provided 74, and r^resentation of such 
algorithms is thereby coded 76, preferably m high-level, register transfer, cm* 
bdiiavioral fimctional format 

Algorithms may be profiled and mapped 78, or otherwise fimctimially ddined 
or cate^nized mannally and/or anlamalicaUy ibr optimized or directed operation or 
implemoLtation of system design modules, functions, signals^ components, or other 
element thereof using cone^ndingly defined kernels 80, pieferabtyuajag one or 
moie specified design buiIdiiig-bIock5,ie.,priniitives 86. ProfiUng and nuqn>ing data 
also are provided for conmranications semq[>hores 84 and scheduling and finite state 
machine ocmtrol and parameters 88. Then, kernel definition 80 and FSM control 
parameterization and scheduling 88, as well as communications semaphores 84 are 
qiplied to inclement single or mufti-threaded elements of present design into 
pnicessor architecture with leconfigurablekemdelemaits 82. FIG. 8 shows 
representative software code of san^le design indicating usage of multi«thread kernels 
90. 

12 
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In accordance mth one aspect of present invention, profiling pzocessing or 
reconfiguiable algorithms representative thereof i$ temporal, thereby including 
determination of certain time value or degree of change over time. Example of 

3 tOTporal Implication includes changes in receiver algorithms required in a cellular 
wireless system and any associated signal processing scheme for these algoriduns 
whidi can take advantage of present profiling methodology. In this example, 
whereiq>on procesang througlqiut requirements in one path (e.g.> reception direction) 
may increase or decrease as processing progresses (e.g., firom antoma to final 

10 retrieved data rq)resentation,) presoit profiling scheme serves to determine hardware- 
software or other functional pardtioning of overall design implementationu 

Further, in such cdlular wireless example, it is contemplated that multiple 
mefliods may perform similar or equivalent signal processing, but result in dififerent 
air-interfim requirements or elective fuiictiQiiality^ Particularly in die hardware 

15 partition of a given system, various fHocessing forms or fimctional elem^ots may 
occur or operate at various rates. Because variable processmg rates msQr be required, 
and various modes of operational control may be dictated by support for multiple 
ITOcesrang streams, sevoal additional non-tmporal and temporal profiling techniques 
maybe qiplied to provide (q>tiinal fimctional fl^ibility in view of available 

20 operational perGoimance point or cq>acity of such hardware architecture (e.g., real- 
time and noli-real-time profiling), ft is contemplated generally h^ein that other 
examples of applicatiim of present innovation may arise additionally with cellular 
wireless, including fixed-wireless, unlicensed wireless LANs, ccHdless telephony, 
tdemeny, and the lite. 

13 
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One piofiling technique applies to hardware-based algorithms across multiple 
modes of operation to detemiine type and number of operations and storage elements 
required, ther^y enffb^tpg designer to classify each t^porally-distinct function in a 
form whidi facilitates idoitification of commonly-used resources. 

Another profiling technique applies for controlling multiple levels of hardware 
definition according to frequency of diange, which is required. Here, mode- 
dq>endent changes in receive path of wireless receiver, for exaiiq>le, may need to 
change at startup for global reconfiguration between transaction configuration (e.g.» 
v/bset transactions are multi-second transactions), and within sub-second transaction 
across blocks of data (e.g., *V)n the fly.'O 

Depending on profiling results, appropriate level of configurable 
implemeotationniay be selected, sudi as forprocessiiigdataat highest data rate 
needing control on per-cycle basis. However, flexibility may be required for control, 
and progranunahie stete madiine msy provide optimal flexibihty meeting necessary 
peiformancerequiremrats. For a datapath i^cb may need to be selected at 
ccHifiguratian time, but is not changed often, then programmable interconnect may be 
appropriately ^Ued. 

Mcnover, if datapafli selection occurs real-time, then datapadi-cell-based 
multiplexing structure may qpply* Also, f(» control functions where operation 
<»deiing is necessary, then parameterized kernels f<x processing operations may iqqily. 
Additionally, in cases of hi^-perfi>imance requirements and low flexibility 
requirements, dedicated datapaths are ^licable to optimize silicon implementation. 
In case of multi-standard wireless receiver design, which delivers optimal flextbiHty 

14 
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relative to performance point, one or more of foregoing profiling techniques are 
applicable 

FIG. 9A Aows general aspects of applying present inventioii, including flow 
for transferring configuration table 92 of capability, parameters and values according 
to one or more industry or proprietaiy standards through plications programming 
inter&ce (API) 94 to provide one or more configuration paramelars for single or 
multi-thjeaded reconfigurable syston implemoitation according to present scheme, 
e.g.» using wired and/or over-the-air wireless network download or otibier 
transmissionfrecq)tiaQ. 

Preferred implonentation receives configuration parameters through API 94 to 
define or implement one or more intercomiected blodc modules 96» representing 
m]m)processor, digitid dgnal processor (DSP), application specific integrated circuit 
(ASIC), field prognamnable gate anay (FFGA),DIUU or odiorfimctional block 
module, which finther may be defined or implonen^ in one or more interconnected 
kernel dem^its 98. In accordance with one a^>ect of present invention, one or mw 
configurable parameters 1 00 may be defined or implemented to coireq3Qnd in 
threaded fashion to one or more specified temel elemoits. Hence, in this 
coafie^uable-paiameter case, design ar^l implementation mettod or system serves to 
I»ocess muhi'-direaded digital signal or data for improved fimcttonal performance* 

Generally^ system design or fimctional definition, algorithm^ electronic signal 
or data file is provided to include sndi multi4hieaded representation, and initial 
prototype fimcdonistfausfKDfiled for parallel processing by <Hie or more duead, for 
exarr^le, to implement certain parameterizaible kernel elements, which may be 
constrained temporally. 
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Moie particularly^ in digital wireless communication application, as shown in 
Fia 9B, portable mobile radio handsets 102 transmit and receive signals wirelessly 
with base station 1 04^ possibly coupled to other handsets 102 and base stations 104 
tfaroiigli digital n^oik 106. In this netwoiked q;iplication, specified design niles» 
operations, or parameters, as well as any symbolic or schematic lepres^tation thoeof 
identify or coirespond to multi-threads, for profiling and implementation in 
progiammable kernels or software modules. 

Optionally, k^nel elements may be configured for oparation in base station 
104 and/or handsel units 102. In particular, kernels may be configured for profiled 
datapath, sequeocer/finite-state-machine, memory, or other logical structure, possibly 
according to temporal or non-teoqporal design constraint 

Foregoing described embodimoits of die invention are provided as 
iUustcations and descriptions. They aie not intended to limit die invention U> precise 
fonn described. 

In particular^ Af^licant contemplates Aat functional implementation of 
invention described herein may be implemented equivalenily in hardware, software, 
finnwarg^ gprf/or other available fimctional components or building blocto. Other 
variations and embodiments are posribte in h^ of above teachmg^a^ 
intended that the scope of invention not be limited by this Detailed Description, but 
laih^by Qaims following. 
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Claims 

What is claimed is: 

1. In a computer-assisted design system, an automated method for processing 
multi-threaded system functionality^ the method comprising the steps of: 

providing a first function definition representing a system design; 

generating fiom the first function definition a second function definition 
representing symbolically the first function definition, such symbolic representation 
identifying one or more Aread associated witfi the system design; and 

profiling each thread for processing by a q[>ecified kernel element or set 
thereof 



2. The mettiod of Claim 1 further comprising the steps of: 
identi^ping a common sequence of 0|ia:ati<xi5 in a given thread; and 
associating (he common sequence of operaticms with a set of operators. 

3. ThemelliodofClaim2fiirtfa^conqffisingthestq>o£ 
assodating the set of operators with a sequence of arithmetic operations. 

4. The method of Claim 2 fiirdto* comprising the step of: 
associating the set of operators with a sequence of control opmtions. 



5. ThemethodofClaim2furtl^coEaprisingtfaestq9 0f: 
associating the set of operators with a sequence of memoiy access operations 
or locations. 
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6. The method of Claim 1 ulierein: 

one or more threads is profiled according to a temporal function. 

5 7. Appamtus for multi-threaded pnx^ssingccHnprising; 

a first kernel element; and a second kernel element; 

wherein the first kernel elanewt processes a first fimction represented by a first 
tiiiead, the second kernel element processes a second fimction represented by a second 
Arcad, the first thread and the secmd thread each being profiled for processing 
10 respectively by the first kernel element and the second kernel element, and the first 
tfiread and die &ec<md fliread bemg associated with a cammcm fimction. 

8. TheiqqiaratQsofClaimTwfaerrai: 

a coimiKm sequence of operations is identifiable with a given thread, 
15 the common sequence ofoperations being associated with a s€t of operators, 

9. The apparatus of Qaim 8 wherem: 

the set of cpeiatDrs is associated with a sequence of arithmetic, control, or 
memory access operations. 

20 

10. The apparatus of Oaini 7 whexidn: 

the first or second thread is profiled according to a temp(H»l constraiix^ 

1 1 . The apparatus of Claim 7 whmin: 

IS 
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the first and second kernel elements are implemented as one or more executable 
software modules. 



12. The apparatus of Claim 7 wherein: 

5 the first and second kernel elements are implemented as one or more 

functional modules in a fixed base station or a mobile handset of a radio 
communication ^stem. 

1 3. In a communication system comprising a base station and one or more 

10 portable units^ wherein each portable unit mdcy oomnnmicate wirelessly through radio 
signals with tte base station, a meflK)d for signal processing ccmiprising the &ep of: 

generating by a base station a first signal representing a system configuration, 
the first signal representing symbolically one or more fimction definition associated 
with one or more thread in fhe system ccmfigurafion, wherein each flvead is 

15 for processing by a q)ecified kernel elment in a portable imit 

14. The method of Claim 13 fiirOier comprising the stq> of: 

xeceiving the first signal by the portable unit, one or more kamel element in 
die portsdile unit being configured to process one or more thread in the ^f'stcm 
20 design according to the first signal. 

15. The method of Claim 13 wherein: 

one or more thread is profiled according to a tenq>oral functional constraint 
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